Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli
نویسندگان
چکیده
BACKGROUND The 1.83 Megabase (Mb) sequence of the Haemophilus influenzae chromosome, the first completed genome sequence of a cellular life form, has been recently reported. Approximately 75 % of the 4.7 Mb genome sequence of Escherichia coli is also available. The life styles of the two bacteria are very different - H. influenzae is an obligate parasite that lives in human upper respiratory mucosa and can be cultivated only on rich media, whereas E. coli is a saprophyte that can grow on minimal media. A detailed comparison of the protein products encoded by these two genomes is expected to provide valuable insights into bacterial cell physiology and genome evolution. RESULTS We describe the results of computer analysis of the amino-acid sequences of 1703 putative proteins encoded by the complete genome of H. influenzae. We detected sequence similarity to proteins in current databases for 92 % of the H. influenzae protein sequences, and at least a general functional prediction was possible for 83 %. A comparison of the H. influenzae protein sequences with those of 3010 proteins encoded by the sequenced 75 % of the E. coli genome revealed 1128 pairs of apparent orthologs, with an average of 59 % identity. In contrast to the high similarity between orthologs, the genome organization and the functional repertoire of genes in the two bacteria were remarkably different. The smaller genome size of H. influenzae is explained, to a large extent, by a reduction in the number of paralogous genes. There was no long range colinearity between the E. coli and H. influenzae gene orders, but over 70 % of the orthologous genes were found in short conserved strings, only about half of which were operons in E. coli. Superposition of the H. influenzae enzyme repertoire upon the known E. coli metabolic pathways allowed us to reconstruct similar and alternative pathways in H. influenzae and provides an explanation for the known nutritional requirements. CONCLUSIONS By comparing proteins encoded by the two bacterial genomes, we have shown that extensive gene shuffling and variation in the extent of gene paralogy are major trends in bacterial evolution; this comparison has also allowed us to deduce crucial aspects of the largely uncharacterized metabolism of H. influenzae.
منابع مشابه
Escherichia coli katE gene product . Haemophilus influenzae is homologous to the A peroxide / ascorbate - inducible catalase
Escherichia coli katE gene product. Haemophilus influenzae is homologous to the A peroxide/ascorbate-inducible catalase from
متن کاملAnalysis of codon usage patterns of bacterial genomes using the self-organizing map.
Codon usage varies both between organisms and between different genes in the same organism. This observation has been used as a basis for earlier work in identifying highly expressed and horizontally transferred genes in Escherichia coli. In this work, we applied Kohonen's self-organizing map to analysis of the codon usage pattern of the Escherichia coli, Aquifex aeolicus, Archaeoglobus fulgidu...
متن کاملIn Silico Metabolic Model and Protein Expression of Haemophilus influenzae Strain Rd KW20 in Rich Medium.
The intermediary metabolism of Haemophilus influenzae strain Rd KW20 was studied by a combination of protein expression analysis using a recently developed direct proteomics approach, mutational analysis, and mathematical modeling. Special emphasis was placed on carbon utilization, sugar fermentation, TCA cycle, and electron transport of H. influenzae cells grown microaerobically and anaerobica...
متن کاملTranscription of foreign DNA in Escherichia coli.
Propagation of heterologous DNA in E. coli host cells is central to molecular biology. DNA constructs are often engineered for expression of recombinant protein in E. coli, but the extent of incidental transcription arising from natural regulatory sequences in cloned DNA remains underexplored. Here, we have used programmable microarrays and RT-PCR to measure, comprehensively, the transcription ...
متن کاملExpression in Escherichia coli
The gene for protein D, a membrane-associated protein with specific affinity for human immunoglobulin D, was cloned from a nontypeable strain of Haemophilus influenzae. The gene was expressed in Escherichia coli from an endogenous promoter, and the gene product has an apparent molecular weight equal to that of H. influenzae protein D (42,000). The complete nucleotide sequence of the gene for pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Current Biology
دوره 6 شماره
صفحات -
تاریخ انتشار 1996